A Method for Classification based on Association Rules using Ontology in Web Data

نویسندگان

  • Hubert Rajan
  • Julia Punitha Malar Dhas
  • Claudia Marinica
  • Fabrice Guillet
  • Hongyu Zhang
  • Yuan-Fang Li
  • Hee Beng Kuan Tan
  • Marcela X. Ribeiro
  • Agma J. M. Traina
  • Caetano Traina
  • Paulo M. Azevedo-Marques
  • Faten Kharbat
  • Haya Ghalayini
  • Jorge J. Villalon
  • Rafael A. Calvo
  • Yuefeng Li
  • Ning Zhong
چکیده

This paper shows a new method based on association rule mining and ontology for the classification of web pages. This work is pruning of association rules, generated by mining process. The main complexity arises due to the fact that there are various number of text documents that are considered for generating the association rules using the A-priori algorithm. But these rules that were generated are not based on the semantic knowledge. In order to obtain the most accurate rules we gone for the construction of the ontology, based on the domain knowledge. With this domain knowledge we design various operators which are helpful in reducing the rules generated. Thus the various rules that we get are semantically correct with regards to the domain selected. We use the high confidence value based classifier for classifying the given text document to that particular domain. Association rules are mined from this matrix using A-priori algorithm. Based on the high confidence value, a new text document is classified into one of the predefined classes. In general, from association rule mining, a huge amount of association rules are mined. All the association rules generated may not be useful for the classification purpose. So, In order to reduce the irrelevant association rules, we need semantic knowledge. For this purpose, propose new domain specific ontology to overcome this drawback of association rule mining method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development of a Combined System Based on Data Mining and Semantic Web for the Diagnosis of Autism

Introduction: Autism is a nervous system disorder, and since there is no direct diagnosis for it, data mining can help diagnose the disease. Ontology as a backbone of the semantic web, a knowledge database with shareability and reusability, can be a confirmation of the correctness of disease diagnosis systems. This study aimed to provide a system for diagnosing autistic children with a combinat...

متن کامل

An Executive Approach Based On the Production of Fuzzy Ontology Using the Semantic Web Rule Language Method (SWRL)

Today, the need to deal with ambiguous information in semantic web languages is increasing. Ontology is an important part of the W3C standards for the semantic web, used to define a conceptual standard vocabulary for the exchange of data between systems, the provision of reusable databases, and the facilitation of collaboration across multiple systems. However, classical ontology is not enough ...

متن کامل

Development of a Combined System Based on Data Mining and Semantic Web for the Diagnosis of Autism

Introduction: Autism is a nervous system disorder, and since there is no direct diagnosis for it, data mining can help diagnose the disease. Ontology as a backbone of the semantic web, a knowledge database with shareability and reusability, can be a confirmation of the correctness of disease diagnosis systems. This study aimed to provide a system for diagnosing autistic children with a combinat...

متن کامل

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Optimizing Membership Functions using Learning Automata for Fuzzy Association Rule Mining

The Transactions in web data often consist of quantitative data, suggesting that fuzzy set theory can be used to represent such data. The time spent by users on each web page is one type of web data, was regarded as a trapezoidal membership function (TMF) and can be used to evaluate user browsing behavior. The quality of mining fuzzy association rules depends on membership functions and since t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012